hep-th/0312262 
AEI-2003-113 



Bulk Witten Indices from D = 10 Yang Mills 

Integrals 

O 

\ T. Fischbacher 

Max-Planck-Institut fur Gravitationsphysik, 
. Albert- Einstein- Institut 

r ^ ' Muhlenberg 1, D-1^76 Potsdam, Germany 

. tf@aei.mpg.de 

m 

CN ' Abstract 



(N 



Values for the bulk Witten indices for D — 10 Yang-Mills integrals for 
regular simple groups of rank 4 and 5 are calculated by employing the 
■ BRST deformation technique by Moore, Nekrasov and Shatashvili. The 

' results cannot be reconciled with the double assumption that the number 

£S| , of normalizable ground states is given by certain simple partition functions 

£N| ' given by Kac and Smilga as well as that the corresponding boundary term 

is always negative. 

m ' 
o 
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1 Introduction 



O 1 Supersymmetric Yang-Mills theories in D dimensions are described by an action 

D , of the form 

S = Jd D x (- + (!) 

k>( \ where indices a,b, . . . correspond to the adjoint representation of some semisim- 

; i ■ pie compact Lie algebra, fx, v, , . . are spacetime vector indices (spatial indices 

will be denoted by . . .), and a,/?, . . . are spinorial spacetime indices; fur- 
thermore, the nonabelian field strength is formed from the vector potential A a ^ 
via 

F% = d,Al d u A; + gf\ c A\A^ (2) 

f a b c being the gauge group structure constants, and the gauge covariant deriva- 
tive is given by 

Df^ = S ab d^+gf cd A^. (3) 

The equality of the number of physical bosonic and fermionic degrees of freedom 
required by supersymmetry furthermore forces the number of transversal modes 
of the gauge field A a , D — 2, to be a power of two, as is the size of the cor- 
responding irreducible spinor representation of the Lorentz algebra. One finds 
that it is in fact possible here to implement supersymmetry in D = 3, 4, 6, 10. By 
truncation to configurations that have no spatial dependency (i.e. dimensional 



reduction to zero space dimensions, see e.g. HUE]), one obtains supersymmetric 
quantum mechanics 03 E3 EU with a Hamiltonian 

H = \P?P? + ^fbcafdeaA^AfA^ + ^ fabc^K^A* (4) 
With If = -ijfij . 

The D — 10 case with gauge group SU(N) regained a lot of popularity af- 
ter initial work by de Wit, Hoppe, and Nicolai [7] who showed the relevance 
of this particular model for the description of the eleven-dimensional super- 
membrane (where SU(N) appears as a regularized version of the Lie group 
of area-preserving membrane diffcomorphisms) through the so-called M(atrix) 
Theory Conjecture @j, which states that this Hamiltonian of a system of N 
Z?0-branes of type IIA string theory should give a complete, non-perturbative 
description of the dynamics of M-theory in the light cone frame. 1 . Furthermore, 
these matrix models play an important role in the IKKT model , which may 
provide a non-perturbative description of IIB supcrstring theory. 

Gauge groups of other types are also of interest here, as realizations of this 
model with SO(N) and Sp(2N) symmetry are given by systems of N type-IIA 
Z?0-branes moving in orientifold backgrounds, cf. |12|. 

One question of chief importance is that of normalizable zero energy vacuum 
states of these models; this is difficult to settle as the potentials have flat valleys 
that extend to infinity, and hence, the corresponding ground states are at thresh- 
old. While it is exceedingly difficult to try to explicitly solve the Schrodinger 
equation for these models, it is already of great interest to know the number of 
such normalizable ground states; for example, it is of crucial importance to the 
Matrix Theory conjecture that there is exactly one such state for every N in 
the models derived from SU (N) gauged Af = 1 SYM (which just corresponds 
to a bound state of N D0-branes that appears as a graviton with N units of 
momentum in the compactified direction). While this is widely believed to be 
the case by now, the situation is still much less clear for other gauge groups. 

As is nicely explained in |22|. the number of normalizable ground states is 
given as the low temperature limit of the partition function 

/ dx lim tre"^ ff , (5) 

J f3^oo 

but as the calculation of this quantity seems beyond reach for most systems of 
interest, it appears more promising to try to calculate the Witten index 

Iw =n B -n F = dx lim tr ( - l) F e" 0H , (6) 

J /3->oo 

instead (where F is the fermion number), as this should also give the number 
of normalizable ground states. In case of a discrete spectrum, this partition 
function would be (3- independent, so we could as well take the limit /3 — ► 0, 

1 See, e.g. 1261 for an overview 
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which is accessible in a perturbative calculation. For a continuous spectrum of 
the Hamilton operator, this does not work in general, since supersymmetry still 
pairs bosonic and fermionic modes, but the spectral density of scattering states 
need not be equal. In this case, the 'boundary' term I\ in the decomposition 

I w =Io + h 

= lim Urn / dx (tr( - if e -PoH(x,x) + f d(3 dptr (- 1) F e~ fm {x,x)\ 

can acquire a nonzero value. While the technique of splitting the integral into 
a 'principal contribution' from the bulk term as well as a 'deficit contribution' 
which takes the form of a boundary term [22] works remarkably well in many 
situations, as one frequently finds that the boundary term is zero even if it 
a priori does not seem to have to be (e.g. [22] EE]), this is not the case in 
the systems at hand. Nevertheless, the boundary term has been calculated for 
SU(2) in [22], and reasons (that are based on the (heuristic) assumption that 
for the calculation of this deficit term the £)0-branes can effectively be treated 
as identical freely propagating particles, as in |2H|) have been given in that 
this deficit term should be 

I?= W {SU{N)) = - ]T — 2 (8) 

m\N,m>l 

for the TV = 1 models derived from D = 10 SU(N) gauged SYM, while one 
expects the value 

I? =\SU(N)) = I?= 6 (SU(N)) = -i- (9) 

for the TV = 4 and TV = 2 models derived from D = 4, resp. D = 6 SU(N) 
gauged SYM. 

Employing the mass deformation method that has been developed in |271 
I21j . Kac and Smilga 15 showed via group-theoretical means that under the 
hypothesis that no large mass bound state becomes non-normalizable as the 
zero-mass limit is taken, the number of normalizable ground states should be 
given by certain simple partition functions: 
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Independent arguments that lead to the same SO(N) and Sp(2N) multiplic- 
ities have been presented in ^2] which are based on an analysis of the Hilbert 
space of a chiral fermion that is constructed from DO branes at an orientifold 
singularity. 

The bulk term Jo is - at least in principle - independently accessible via 
a generalization [25] of the BRST deformation method which was devised by 
Moore, Nekrasov, and Shatashvili ^Uj m order to greatly simplify the calcu- 
lation of the corresponding partition functions by adding terms to the action 
which break all but one supersymmetry (so that the partition functions do not 
change) 2 . Concerning the boundary term, the method of m has been general- 
ized in J5| to other gauge groups, but it was found that the expected Witten 
index I w = Jo + 1\ could not be obtained that way indicating a failure of 
the assumption of the validity of the free Hamiltonian approach used in 
Nevertheless, for all the D = 4,6, 10 cases investigated previously, the expected 
vacuum degeneracies support the hypothesis [2SJ that the boundary term is 
(with the possible exception of Jf >=10 (G , 2)) always a small number in the inter- 
val [— 1; 0], and hence a prediction of the Witten index should be possible from 
the bulk index alone. 

A first evaluation of Iq =1 ° for special orthogonal and symplectic groups 
employing the Moore method has been performed in |25j : there as well as in 
further work on I® =4 [201 > the algebraic bulkiness of bulk Witten index calcu- 
lations was pointed out, and indeed, even for the Jo° =4 case, going to ranks far 
beyond 3 already required specialized term manipulation code to be written. 
In the following, we want to review the operational issues of the application 
of the Moore method to the even far more involved D = 10 case and present 
techniques (some conservative, some speculative) that allow the calculation to 
be taken to regular simple groups of rank 4 and 5, and also give arguments that 
such a direct approach is barely feasible for simple groups of rank > 6 employing 
currently available computer technology. 

2 It was verified numerically in 1161 for models obtained from D = 4 SYM with low-rank 
gauge groups that this method seemingly also works for other groups besides SU (N) 
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2 Computational aspects 



The heat kernel calculation 2|| of the bulk contribution to I w requires evalua- 
tion of the partition function 3 



1 



T — 7 J 

~ ~~Tg G 



„dimG D i™ J\f 



i _ i 

2? 



x ex P ( 7^™ ^ A v ] + —Tr^jr^X^, rf,p] 



(11) 



„dimG D dX 0, / 1 \ 

= j II n^^PfH/^r-X^exp^Tr^,^]^,^ 

where the fermionic degrees of freedom have been integrated out, yielding a 
homogeneous Pfaffian, cf. ^7]. These partition functions Z^q, which come 
from the reduction of SYM to zero dimensions (see e.g. 0) have been dubbed 
'Yang-Mills integrals'. The factor T G that relates it to the bulk Witten index 
is basically the effective gauge group volume, see [25] for details. 

The BRST deformation technique of greatly simplifies this to the calcu- 
lation of the integral [25] 



^=10 (G) _ \ Z <-: ( {E > +E-,UE 2 + E,)(E, E] ) 

r 

n 



\W G \ V E1E2E3E4 
dx k A G (0,x)A G (S 1 + E 2l x)A G {E 2 + E 3) x)A G (E 3 + E u x) 



~ £=i 2wi A G (E 1 ,x)A G (E 2 ,x)A G (E 3 ,x)A G (E 4 ,x) (12) 

with 

A G (E,x) = Y[ (x-a-E) 
SeA G 

where r is the Lie group rank, \Z G \/\W G \ is the quotient of the orders of the 
center and the Weyl group (l/(2 r H) for B r , l/(2 r -V!) for C r , l/{2 r - 2 r\) for 
D r , 1/12 for G 2 ; these are the only cases we are concerned with here), i?i,2,3,4 
are auxiliary real quantities with Ej = which the end result will not 
depend on (at least for sufficiently generic values of Ej where no 'accidental' 
merging of poles/zeroes happens). A G is the set of roots of the Lie algebra G. 
Actually, as it stands these integrals do not make much sense, as we have poles 
on the real axis of integration. (Furthermore, they do not fall off fast enough 
towards complex infinity to rigorously justify closing the integration contour.) 
The correct (not yet fully justified, see JUj) interpretation of these integrals 
rather is the following algorithmic one: 

3 Here and in the following, we will use the conventions of 1251 
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1. Successively eliminate all Xk 'integration variables' in the term 

\Z G \ ( (£1 + E 2 )(E 2 + E 3 )(E 3 + £Q N 
\W G \ V E X E 2 E Z E± 

x lim Rcs. + . . . Res x + Mg^^ + ^x 
^ *r A G (E 1 ,x)A G {E 2 ,x) 

A G (£ 2 + £ 3 , x)A G (E 3 + Ex, x) 



(13) 



A G (E 3 ,x)A G (E 4 ,x) 



where the Ek,Xk are all treated as real, and Res+ picks up the residues 
with positive imaginary part only. (All the Xk except the one being inte- 
grated out are treated as real.) 

2. Substitute £ 4 = -E x - E 2 - E 3 

3. Evaluate at generic values of £1,2.3 ■ (Alternatively: simplify to find that 
the result does not depend on £1.2,3-) 

Due to the large number of factors in the denominator that occur in these 
formulae for all but the smallest simple groups, one has to resort to employing 
clever tricks to simplify the calculation, or massive computer aid (or both). 

While the use of symbolic manipulation programs like Maple or Mathematica 
suggests itself for calculations like the ones at hand, and indeed interesting 
results have been obtained that way the range to which these calculations 
can be carried by employing such systems in a head-on approach is quite limited, 
mostly due to the observation that (unlike the D — 4 case) in the evaluation of 
(B, C, D) n integrals, one seems to generically encounter poles of order (n — 1) 
that require forming (n — 2)th order derivatives of very large products, leading 
to an explosion of the number of terms generated. Nevertheless, it is obviously 
important to try to obtain these values for as many gauge groups as possible in 
order to check the validity of various assumptions that had to be made in the 
calculation of Witten indices, and perhaps even make further conjectures about 
analytical expressions, e.g. as in |2U] for D = 4. 

Even if one cannot do much about combinatoric explosion here, the question 
nevertheless arises, whether - considering the conceptual simplicity of the prob- 
lem - one may be able to go a few steps further by trying to make use of as much 
of the structure of the calculation as possible in a dedicated program. While 
it is frequently possible to outperform general-purpose symbolic algebra pack- 
ages by three orders of magnitude in term complexity with such an approach, 
more than one rabbit has to be pulled out of the hat in order to achieve a per- 
formance gain of one million or more, which we will see to be necessary here. 
As the underlying techniques are of interest for a far larger class of symbolic 
calculations, yet not widely known since there is (to the author's knowledge) 
hardly any literature on the relevant issues, we want to briefly present some of 
the fundamental concepts one should be aware of when taking symbolic algebra 
to its limits. Incidentally, the calculation at hand is an almost perfect example 
to study these techniques. 
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Conventional symbolic algebra frequently (and especially in calculations like 
these) wastes most of its time generating intermediate quantities of very limited 
lifetime in dynamically allocated memory which has to be reclaimed soon after. 
In the following, we want to adhere to the convention to call this allocation of 
dynamic memory which will not explicitly be reclaimed consing . As unneces- 
sary consing forces frequent expensive calculations of reachability graphs of in- 
memory objects (in order to identify rcclaimable space in the garbage collection 
process), it should as a rule of thumb be avoided where possible. Furthermore, 
one should keep in mind that for the presently dominant computer architecture 
the processor's memory interface is an important bottleneck, and hence it makes 
sense to try to find tight memory encodings for those pieces of data on which 
most of the calculation operates so that cache stalls are minimized. 

One particularly striking feature of the calculation at hand is that naively in- 
tegrating out a single variable makes conventional term manipulation programs 
first allocate huge numbers of almost similar terms, which are then checked (in 
an overwhelming number of cases in vain) for possible annihilations. This can be 
avoided by re-structuring the calculation in such a way that instead of generating 
and processing large amounts of individual mostly similar (yet different enough 
that cancellations become quite rare) terms, one and the same term backbone 
is destructively modified to consecutively represent every single new term gen- 
erated and then do further processing on this backbone wherever possible. This 
technique is particularly useful when it comes to processing higher-order deriva- 
tives (where in addition we take care of performing the iterations over places of 
factors where to derive in such a way that re-occurring combinations of deriva- 
tives, as in (fg)" = f"g+fg' + fg»+fg> = f»g + fg» +2 fg> are only generated 
once). 

One particular refinement for the present calculation is that one generally 
can hardly avoid creating new terms by substitution, except in the very last 
step (i.e. integrating out the last variable) where this is indeed feasible. Since 
the last step is also executed most frequently, this almost buys us an extra rank 
for free. (A slightly nontrivial subtlety for the D = 10 calculation here is that 
we have to take care of the possibility of generating factors E\ + E 2 + E 3 + E 4 
in both the numerator as well as the denominator. It may well happen that 
while accumulating the factors of a term, one intermediately encounters more 
such 'powers of zero' in the denominator than in the numerator.) 

As calculations with exact numbers cause some systems to perform excessive 
number consing, it is wise to try to avoid the exact rational number data type 
for all those parts of a calculation where one can use with impunity more limited 
data types that have fast direct hardware support (like 32-bit integers) 5 . For 
our present calculation this means to represent the coefficients of linear func- 
tions as machine integers. (Clearly, one has to take care of proper handling of 

4 Named after the cons (pair), which is the most ubiquitous building block of hierarchical 
data structures in LISP 

5 Actually, most systems providing a dynamic GC need to use a few tag bits to discern 
between immediate and referenced (i.e. consed) values, so we typically only can use 30 bit 
signed machine integers on 32-bit hardware 
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some 'balancing denominators' that are generated by this somewhat artificial 
treatment that requires to take least common multiples at substitutions.) For 
all of the problems that are reachable with present computer hardware, all of 
the substitutions of such terms among themselves will not lead us outside the 
range of these integer machine data types. 

A further property of the calculations under study is that when performed 
in the way described here, a large fraction of the contributions are zero, and 
in many cases, this can be detected somewhat easily without doing costly mul- 
tiplications. Hence, it makes sense to pre-scan a term for being zero before 
processing it wherever applicable. 

It is perhaps one of the less obvious properties of these Yang-Mills integrals 
that one can do much better than to substitute particular values for Ex.. .4 sub- 
ject to the constraint J2j Ej — f° r the final evaluation; by bringing all factors 
to a proper lexicographical normal form (E\ contribution first if present, then 
E2 contribution, then E3 contribution, all after having substituted out E4), one 
observes to obtain also the correct final value by replacing every linear term by 
its leading coefficient. 6 While the largest calculations at present cannot be done 
without employing this trick, we do a cross-check using the more conventional 
method wherever this is possible. This is indicated in the last column of our 
table of results. 

As a special refinement, one notes that in the final gathering of powers of 
linear factors, the most frequently encountered numerators and denominators 
are ±1 as well as small powers of 2. Indeed, a further noticeable speedup can be 
achieved by treating these factors special (remembering one overall power of 2 
as well as the resulting sign) in order to avoid unnecessary use of exact rational 
arithmetics. 

As a basis for the implementation, the Objective Caml system |18j appears 
as very appealing, since it is highly portable to a variety of different platforms, 
contains an optimizing compiler that can generate compact standalone binaries, 
allows a very smooth and easy two-way integration of C libraries and code; all 
these qualities are highly desirable here especially since the problem at hand 
suggests itself to massive parallelization, perhaps by making use of donated 
computation time. (While rudimentary parallelization support is present in 
our implementation and has proven its usefulness, we do not implement such a 
large-scale scheme here, mainly because it is expected that one would need a 
prohibitively large number of volunteers to successfully do the next rank.) A 
further advantage may be that Ocaml code generally is perceived as much less 
alien by the uninitiated than LISP code. One further noteworthy issue here 
is the quality of the built-in exact rational number arithmetics, since there are 
all but obvious huge performance differences between various implementations 7 . 
While the implementation present in Ocaml 3.07 is slower than the one of CLISP 

6 One of the more miraculous properties of these integrals is that one also observes to 
obtain the correct final value by replacing the sum in every linear factor by a product, i.e. 
{^Zj CjEj^j (YljCjEj^ ; this was discovered through an intermediate programming error. 

7 For example, Gambit Scheme large fraction arithmetics is roughly by a factor 2500 slower 
than the excellent one present in CLISP, which even well outperforms e.g. that of Maple. 
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by a factor of roughly 300, this is not a big problem, since our techniques to 
reduce the use of fraction arithmetics are powerful enough (at least when mak- 
ing use of the evaluation shortcut described above) to make the amount of time 
spent in arithmetics comparable to the time spent in other parts of the pro- 
gram. CLISP would perhaps hardly be a viable alternative despite its excellent 
arithmetics implementations, since it is only a byte-code interpreter system. As 
there are good reasons to make the code with which these calculations have been 
performed publicly available 8 , it has been included in the arXiv.org preprint 
upload of this work 9 . 

3 Results and discussion 

The following table shows our new as well as the previously known values for the 
bulk contributions to the Witten index. Numerical approximations as well as 
the expected values from the Kac/Smilga hypothesis are also given. Calculation 
times refer to accumulated wall clock times on a stepping-9 Intel Pentium 4 
CPU, 2.4 GHz, Hyperthreading enabled/ocaml 3.07, gec 2.95, Linux kernel 
2.4.24 student computer pool installation. (Values obtained on other systems 
have been re-scaled appropriately.) Boldface indicates new values. 

Some features that deserve special attention here are the particularly large 
values for 50(8) and Sp (10) that would require h < -1, the 50(9) and 50(11) 
values that would require I-y > 0, and the dramatic explosion of calculation time; 
beyond some 'trivial' cases, increasing the rank by one costs a factor (as a rule 
of thumb) of roughly 1000 in CPU time in the interesting regime. Hence, it 
is probably not yet feasible to try to attack rank 6 for the (B, C, D) n groups. 
As the A n bulk Indices are just the expected ones, and as the (B,C,D) n are 
of the right magnitude, with denominators being powers of two (this is not the 
case at all for individual summands), there is good reason to believe in both the 
validity of this approach and its implementation. However, the calculation of 
the O4 index by using explicit values for £4,2,3 seems to fail systematically for 
yet unknown reasons, despite a very careful analysis of the code. (This may be 
related to yet another bug in the ocaml compiler; this assumption is also fueled 
by the observation that part of the Dq calculation causes memory violations 
that should not be possible at all in pure ocaml for a very specific combination 
of processors and optimization flags.) 

8 first, as we are employing some algorithmically nontrivial tricks, one cannot exclude the 
possibility of errors having slipped in, hence it is provided for the sake of reproducibility 
and checkability of all steps; second, there are many more group-theoretical tricks one may 
want to exploit here and incorporate into the existing code; third, one may want to carry 
the calculation further as soon as more computer power becomes accessible, or by employing 
massive parallelization; fourth, it may be instructive to see the detailed implementation of 
some of the techniques described above. 

E http : //www . arxiv . org/e-print/hep-th/0312262 
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2 
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1 
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(As) 
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SO( 8) 
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2048 
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1.5 


10 2 
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ok 
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1 
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ok 
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10- 1 


ok 
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10 2 
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Sp(10) 




583755 
131072 


4.4537 


3 


1.9 


10 6 




SU( 2) 
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4 


1.2500 


1 


3.4 


10- 4 


ok 


SU( 3) 




10 

9 


1.1111 


1 


8.5 


10~ 3 


ok 
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(As) 
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16 


1.3125 


1 


3.7 


10- 1 


ok 


St/( 5) 




26 
25 


1.0400 


1 


4.7 


10 1 


ok 


SU( 6) 


{A,) 


25 
18 


1.3889 


1 


1.4 


10 4 




St/( 7) 


(A 6 ) 


50 
49 


1.0204 


1 


1.1 


10 7 




G 2 




1375 
864 


1.5914 


2 


4.5 


10~ 2 


ok 
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All in all, the new data clearly show that for other gauge groups than SU(N), 
the issue of the number of vacuum states is not well understood, and more work 
has to be put into the determination of Witten indices. In particular, we do not 
have at present a useful theory to calculate the boundary contributions, and it 
is well conceivable that some of the assumptions behind (|10|l may be violated. 
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